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1. Summary 



Apicomplexan protists such as Plasmodium and Toxoplasma contain a mito- 
chondrion and a relic plastid (apicoplast) that are sites of protein translation. 
Although there is emerging interest in the partitioning and function of translation 
factors that participate in apicoplast and mitochondrial peptide synthesis, the com- 
position of organellar ribosomes remains to be elucidated. We carried out an 
analysis of the complement of core ribosomal protein subunits that are encoded 
by either the parasite organellar or nuclear genomes, accompanied by a survey of 
ribosome assembly factors for the apicoplast and mitochondrion. A cross-species 
comparison with other apicomplexan, algal and diatom species revealed com- 
positional differences in apicomplexan organelle ribosomes and identified 
considerable reduction and divergence with ribosomes of bacteria or characterized 
organelle ribosomes from other organisms. We assembled structural models of sec- 
tions of Plasmodium falciparum organellar ribosomes and predicted interactions with 
translation inhibitory antibiotics. Differences in predicted drug-ribosome inter- 
actions with some of the modelled structures suggested specificity of inhibition 
between the apicoplast and mitochondrion. Our results indicate that Plasmodium 
and Toxoplasma organellar ribosomes have a unique composition, resulting from 
the loss of several large and small subunit proteins accompanied by significant 
sequence and size divergences in parasite orthologues of ribosomal proteins. 



2. Introduction 

Plasmodium parasites have three genomes [1]: a 23 Mb nuclear genome distributed 
on 14 linear chromosomes [2], a 35 kb circular genome found in the relic plastid 
(the apicoplast) [3] and a 6 kb linear genome in the mitochondrion [4,5]. Each 
of these genomes is transcribed by its own apparatus [6-8] and each compart- 
ment possesses a suite of unique ribosomes for its translation [9-11]. Recent 
reports have provided insights into the partitioning, function and antibiotic 
interactions of organellar translation factors in Plasmodium spp. [12-16]. 

Eukaryotic ribosomes consist of one large (60S) and one small (40S) subunit 
which come together during translation to form an 80S particle. By contrast, 
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ribosomes of bacterial origin consist of a large (50S) and small 
(30S) subunit that assemble to form a 70S particle. Consistent 
with their endosymbiotic origins, the apicoplast and mito- 
chondria contain 70S ribosomes that are distinguishable in 
size (around 20 nm) from the 80S eukaryotic-type ribosomes 
(around 25-30 nm) found in the cytosol and rough endoplas- 
mic reticulum (ER) [9,17,18]. In addition to ultrastructural 
characterization, early sequencing of organellar DNA 
revealed bacterial-type rRNA molecules on the mitochondrial 
and apicoplast genomes [19,20], although the unexpected 
presence of the apicoplast understandably gave rise to con- 
fusion between apicoplast and mitochondrial DNA in some 
of the earliest analyses [21]. Further sequencing of the 35 kb 
apicoplast genome revealed the presence of a complete set 
of rRNAs as well as a cluster of ribosomal proteins of clear 
plastid and bacterial origins [3]. Complete sequencing of 
the 6 kb mitochondrial genome revealed a collection of frag- 
mented rRNA molecules, but no ribosomal proteins [19,22]. 

Initial analysis of sequenced Plasmodium nuclear DNA frag- 
ments and expressed sequence tags (ESTs), then later assembly 
of the entire Plasmodium falciparum nuclear genome, revealed 
many more ribosomal proteins with apicoplast and mitochon- 
drial targeting sequences [2,23] that are post-translationally 
processed for targeting to organelles. The subsequent sequen- 
cing of organellar and nuclear genomes from a large number 
of other apicomplexans has expanded our picture of ribosomal 
and other translation components in organelles. Here, we 
attempt to clarify the complement of the core protein trans- 
lation components by performing a cross-species survey of 
ribosomal proteins and ribosome assembly factors required 
for organellar translation in apicomplexans. 

Our survey identifies considerable divergence between 
the organellar ribosomes of apicomplexan parasites and the 
ribosomes characterized in bacteria or other endosymbiotic 
organelles. In addition to very significant sequence and size 
divergences in identified orthologues of ribosomal proteins, 
several ribosomal proteins are either missing or sufficiently 
divergent to be unrecognizable. Within the phylum, we 
also detect several differences in ribosomal protein compo- 
sition, both in those encoded by apicoplast genomes and 
those found in the nucleus. 

Using the conserved ribosomal proteins and rRNA species 
identified, we have assembled structural models of the sections 
of the apicoplast and mitochondrial ribosomes to predict 
interactions of those ribosomes with parasite-killing drugs pre- 
dicted to bind to bacterial ribosomes. We find considerable 
differences in these predicted drug-ligand interactions, with 
several of the modelled structures suggesting specificity of 
inhibition between apicoplast and mitochondrial ribosomes. 

3. Results and discussion 

3.1. Compositional analysis of apicoplast and 

mitochondrial ribosomes of Plasmodium falciparum 

We conducted a survey of available sequences of apicomplexan 
apicoplast genomes, comparing ribosomal proteins encoded 
by different species. A list of ribosomal proteins was first 
assembled, based particularly on the well-annotated nuclear 
and organellar genomes of the red alga Cyanidioschyzon merolae 
[24-26] and the diatom Thalassiosira pseudonana [27,28]. We 
used several search strategies — genome projects were 



interrogated by text searches to find all annotated ribosomal pro- 
teins, and these were manually examined, gene models and 
predicted proteins were subject to blastp searches, whereas 
genome nucleotide data were subjected to tblastn searches. 
The OrthoMCL database of orthology groups [29] was also 
searched to find relevant homologues of ribosomal proteins. 

3.2. Organellar genomes 

These searches revealed several ribosomal proteins on the 
apicomplexan organellar genomes that had previously been 
missed as open reading frames (ORFs), or annotated as 
hypothetical ORFs. The 50S ribosomal protein Lll had pre- 
viously been annotated on the Toxoplasma gondii apicoplast 
genome [30] but the syntenic protein on the P. falciparum 
genome had been hitherto annotated as orfl29 [3]. This protein 
can now be assigned as the missing 50S Lll (table 1; gene IDs 
detailed in the electronic supplementary material, table SI). 
Similarly, the apicoplast 30S ribosomal proteins S4, S7 and 
S19 and the 50S proteins L4 and L36 had previously not been 
annotated in Babesia — we found ORFs on the B. bovis apicoplast 
genome that correspond to S19 and L36 at similar positions as 
on the P. falciparum apicoplast genome (table 1; electronic 
supplementary material, table SI). 

Several differences are seen between the apicoplast riboso- 
mal complements of apicomplexan parasites. Of those 
proteins encoded by the apicoplast genome itself, the 50S 
protein L23 is present in Plasmodium, but absent from the 
Toxoplasma apicoplast genome, and undetectable in its nuclear 
genome. Rpl23 has also been previously noted as missing 
from the Eimeria apicoplast genome [31] and is not apparent 
in other apicomplexan genomes (table 1). This protein, thought 
to be involved in chaperone docking, is non-essential for growth 
in Bacillus subtilis [32], and eukaryotes, eubacteria and archaea 
have divergent ribosomal structures around the L23 site [33] 
so its absence in some apicomplexan parasites is plausible. 
Some chloroplast genomes lack L23; and in spinach, the 
role of L23 has been postulated to be replaced by chloroplast 
targeting of a eukaryotic 60S type L23a/L25 [34]. However, 
no N-terminal targeting sequences are apparent on the 
corresponding Toxoplasma genes. 

Another difference between apicoplast genomes within 
Apicomplexa is the presence or absence of the ribosomal 
protein SI 7 (table 1). Plasmodium, Toxoplasma and Eimeria api- 
coplast genomes carry this gene, but it appears to have been 
lost from the apicoplast genomes of the piroplasm parasites 
Theileria and Babesia. We found no evidence for transfer of 
this apicoplast gene to the nucleus in these parasites (though 
mitochondrial SI 7 representatives are present), but SI 7 is 
small and relatively poorly conserved at a primary sequence 
level, so may simply be undetectable in the order Piroplasmida. 

3.3. Missing large subunit ribosomal proteins 

A number of large subunit (LSU) organellar ribosomal proteins 
appear to have been lost altogether from apicomplexan genomes. 
A striking apparent absence in Apicomplexa is the organellar 
Rpl5. L5 is a 5S rRNA-binding protein that is essential for assem- 
bly of the 50S central protuberance in bacteria [35], most of which 
is clearly retained in apicomplexan ribosomes; however, L5 is 
missing from mammalian mitochondrial ribosomes [36], so is a 
plausible absence from apicomplexan organellar ribosomes as 
well. Another 50S ribosomal protein that binds the 5S rRNA, 



Table 1. Plastid and mitochondrial ribosome large subunit (LSU) and small subunit (SSU) proteins identified for apicomplexan parasites (P. falciparum, I gondii, 
B. bovis, I parva and £ tenella), red alga (C. merolae), green alga (C reinhardtii) and diatom {T. pseudonana). #, assigned by sequence similarity or by 
excluding other organellar counterpart, but targeting leader is non-obvious; $, contains an internal stop codon that may be suppressed. The L7-L12 dimer in 
eukaryotes is referred to as L8, but L7 and L12 are represented by a single gene in bacteria and organelles. Ticks in black correspond to nuclear-encoded 
proteins, ticks in red correspond to mitochondrial-encoded proteins and ticks in green correspond to plastid-encoded proteins. Crosses on grey background 
correspond to proteins for which a comprehensive search was performed on organellar and nuclear genomes and failed to detect any orthologue. Only 
plastid-encoded ribosomal proteins are listed for the Apicomplexans Babesia, Theileria (Piroplasmida) and Eimeria (Coccidia). 
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L25, is missing from other plastid and mitochondrial ribosomes 
[36,37] and is also absent from apicomplexan genomes. 

The 50S ribosomal protein RpllO is an apicoplast-targeted 
protein readily detected in Plasmodium, Babesia and Theileria, 
but not in Toxoplasma. L10 is relatively large and well conserved 
with clear apicoplast-targeted orthologues in Plasmodium spp. 
but no equivalent is obvious in Toxoplasma. Mitochondrial L10 
homologues are not obvious for any apicomplexan species. In 
some other organelles, L10 appears to have been replaced by a 
nuclear L10 [38], but no L10 with an N-terminal targeting 
sequence is apparent in Toxoplasma. 

The LI 9 and L20 proteins have orthologues in Plasmodium 
and Toxoplasma with probable mitochondrial targeting 
sequences, but we found no orthologues of these with apico- 
plast targeting leaders (table 1 and figure 1). These are small 
proteins (approx. 120 amino acids each) and may hence 
be missed in a sequence similarity search. Several other 50S 
ribosomal proteins — L30, L31, L32 and L34 — have mixed dis- 
tributions in other organellar ribosomes [37,39,40], and we 
found no apicoplast or mitochondrial representatives of any 
of these proteins in apicomplexans. 

3.4. Missing small subunit ribosomal proteins 

Compared to the 50S subunit, the 30S subunit retains propor- 
tionally more members in the apicoplast genome rather than 
transfers to the nuclear genome (table 1; gene IDs detailed in 
the electronic supplementary material, table SI). Several proteins 
are also missing or undetected among the 30S proteins of the 



mitochondria and apicoplast. The mitochondrion in particular 
appears to be missing a large number of subunits, and we 
were unable to find mitochondrial targeted orthologues of SI, 
S2, S3, S4, S7, S10, S13, S19, S20 or S21. Most of these are retained 
on the mitochondrial genomes of diatoms (table 1) and are 
widely conserved among other organellar ribosomes, so their 
complete absence in apicomplexan mitochondria is unexpected 
and not easily explained. One possibility is that the mitochon- 
drial ribosomes employ prokaryotic subunits encoded by the 
apicoplast (though no mechanism is obvious for this) or may 
use proteins dually targeted to the mitochondrion and apico- 
plast. Several apparently mitochondrial targeted proteins are 
annotated as 30S ribosomal proteins in apicomplexans, includ- 
ing S22, S29 and S35. These are not widespread members of 
mitochondrial ribosomes, so their presence here may be linked 
to the possible absence of other canonical members. 

Several 30S proteins are also apparently lacking in the api- 
coplast ribosomes. Despite the presence of clear orthologues in 
red algal and diatom organelles (table 1), no apicoplast (or 
mitochondrial) S13 ribosomal proteins are apparent in any api- 
complexan species. This protein interacts with the 50S subunit 
and the P-site tRNA during translocation [41] and is essential 
for translation in other bacterial ribosomes [42], so its apparent 
absence is puzzling [42]. 

Apicomplexan parasites also appear to have lost their 
apicoplast version of S16, which is encoded on the plastid gen- 
omes of diatoms and of red and green alga, but have retained 
mitochondrial targeted SI 6 proteins (table 1 and figure 1). In 
bacteria, SI 6 is essential and plays a central role in 30S ribosomal 
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assembly [43]. SI 6 is dual targeted between mitochondria and 
chloroplasts in many plants [44] but we see no evidence for 
the presence of a possible apicoplast leader upstream of 
the mitochondrial S16 in Plasmodium or Toxoplasma. Another 
30S protein, SI 8 is absent from all the apicomplexan apicoplasts 
we surveyed, though mitochondrial SI 8 s are present (table 1). 
This protein has no obvious orthologue in archaeal or eukaryotic 
ribosomes [45], although it is essential in tobacco plastids [46]. 

S20 and S21 are also missing from the apicomplexans we sur- 
veyed. S20 is not essential in Salmonella [47], knockout of S21 
impairs but does not ablate translation in other plastids [48], 
and neither is essential in B. subtilis [32] so these are plausible 
absences from the organellar ribosomes of Apicomplexa. 

3.5. Ribosome assembly proteins for the apicoplast 
and mitochondrion 

Ribosome biogenesis involves multiple steps of ribosomal RNA 
(rRNA) processing and association of rRNA with ribosomal 
proteins [49-51]. As with any complex RNA molecule, the 
rRNA in parasite organelles is prone to the formation of numer- 
ous local non-native secondary structures. A set of cofactors 
known as ribosomal biogenesis/ assembly factors prevents for- 
mation of these stable, misfolded regions in the rRNA and 
promotes ribosome assembly [50]. These factors serve as 
check points during the assembly process where they mediate 
proper rRNA folding and protein- RNA interactions by creat- 
ing specific nucleotide modifications in rRNA or by acting as 
RNA /protein chaperones. This ultimately results in the assem- 
bly of mature ribosomal subunits. We performed an extensive 



search for putative ribosomal biogenesis factors targeted to 
the apicoplast and mitochondrion in the PlasmoDB genome 
database using current annotations as well as new assignments 
based on targeting prediction algorithms. 

Ribosome assembly factors belong to the following broad 
categories — GTPases, chaperones /maturation factors and 
DEAD-box proteins [52]. GTP hydrolysis by proteins of the 
GTPase superclass is involved at different stages of ribosome 
biogenesis mediating subunit assembly. Era, Der, Obg and 
YihA are known to interact with either the mature subunits 
or the 70S ribosome while YlqF also exhibits interaction with 
a ribosomal subunit intermediate [53]. Sequence analysis indi- 
cates the presence of multiple P-loop GTPases in P. falciparum 
that contain highly conserved motifs (table 2). The Der protein 
is conserved among eubacteria but not in archaea or eukar- 
yotes [52]; two Der homologues, with predicted targeting to 
the apicoplast and mitochondrion, respectively, could be ident- 
ified. A search for homologues of organellar Era and YihA 
proteins yielded putative candidates with mitochondrial local- 
ization while the single YlqF homologue had apicoplast 
targeting elements. Two candidates were found for Obg, one 
of which was predicted to be mitochondrial while the other 
appears to be targeted to the apicoplast. 

Chaperones assist in proper folding /unfolding and 
assembly/disassembly of ribosomal proteins and rRNA. We 
identified seven chaperones, five of which are already annotated 
in previous reports as being targeted to the apicoplast (DnaJ, 
Cpn60 and Cpn20) or mitochondrion (Cpn60 and CpnlO) 
[55,56]. Two other putative chaperones — DnaJ and DnaK — that 
might be apicoplast- and mitochondrion-targeted, respectively, 
were also identified (table 2). In addition to chaperones, RNA 



maturation factors play a vital role in the rRNA modifications 
during ribosome biogenesis. RrmJ, RsmB and KsgA are methyl- 
transferases that methylate specific nucleotides in rRNA during 
their maturation [57-59]. KsgA methylates two adjacent adeno- 
sine residues at the 3' terminal helix of small subunit (SSU) rRNA 
that are two of three nucleotide modifications that are known to 
be conserved in nearly all known ribosomes throughout evol- 
ution [60] with few exceptions [61-64]. Two homologues of 
RlmE/RrmJ were identified in the P. falciparum genome, one of 
which had a predicted mitochondrial targeting signal while 
the location of the other cannot be clearly predicted. Two 
KsgA/RsmA were predicted, one for the mitochondrion and 
the other with possible dual targeting to both organelles. A 
single RsmB with possible dual targeting to the apicoplast and 
mitochondrion was also identified. Further, a homologue of 
the ribosomal maturation factor RimM that is involved in SSU 
biogenesis [65] is predicted for the apicoplast (table 2). 

DEAD-box proteins, which are conserved across bacteria 
and viruses to humans [66], belong to a large family of RNA 
helicases that possess RNA-dependent ATPase activity. They 
act as RNA chaperones, mediate RNA -protein interaction 
and unwind local RNA structures [67-69]. A number of 
putative proteins that may belong to the DEAD-box family 
and are predicted to contain sequence elements for organellar 
import (PF3D7_1445900, PF3D7_02 18400, PF3D7_1 332700, 
PF3D7J418900, PF3D7_0504200, PF3D7_1021500 and 
PF3D7_1251500) were identified. These proteins have a con- 
served DEAD-box motif and RNA helicase domain but could 
not be unambiguously classified as a specific member (SrmB, 
CsdA, DbpA, RhlE or RhlB) of the Escherichia coli DEAD-box 
helicase family [70-74]. 

3.6. Structure modelling of Plasmodium falciparum 
organellar ribosome subunits and drug 
interaction sites 

Several antibiotics, including clindamycin, chloramphenicol 
and the macrolides erythromycin and azithromycin, bind 
in the vicinity of the ribosome LSU peptidyl transferase 
centre or the peptide exit tunnel and inhibit parasite 
growth. This group also includes thiostrepton that contacts 
ribosomal protein Lll and the GTPase region of 23S rRNA 
[75]. Translation inhibitory antibiotics have two putative 
target organelles, the apicoplast and mitochondrion, of the 
parasite. Some antibiotics (e. g. clindamycin, azithromycin, 
chloramphenicol and tetracycline) have been demonstrated 
to have a delayed-death effect, a phenotype associated with 
apicoplast-specific action [76,77]. A single point mutation in 
the LSU rRNA gene of the T. gondii apicoplast confers clinda- 
mycin resistance in vitro [78] and resistance to azithromycin 
in P. falciparum has been attributed to two point mutations: 
one in the P. falciparum apicoplast LSU rRNA and a second 
in the apicoplast-encoded Rpl4 [79]. Thiostrepton causes 
immediate parasite killing and is proposed to have additional 
targets in P. falciparum [80,81]. In order to understand the 
differential interaction of these drugs with apicoplast and 
mitochondrial ribosomes, we carried out in silico modell- 
ing of LSU rRNA and relevant ribosomal proteins (L4, Lll 
and L22) involved in interactions with antibiotics in bacteria. 
This was followed by docking of antibiotics in order to 
estimate their relative specificity for mitochondrial and 
apicoplast ribosomes. 




Figure 2. Structure models of P. falciparum apicoplast (a) and mitochondrial (b) LSU rRNA and proteins L1 1, L4 and L22. The rRNA and protein subunits were 
modelled separately and superimposed on the E. coli ribosome template to generate the ribosome complexes. LSU rRNA is shown in cyan and proteins in red. 



Table 3. Docking scores of antimicrobials in the active site of large ribosomal subunit of E. coli, and P. falciparum apicoplast and mitochondrion. 
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P. falciparum mitochondrial LSU 


E. coli LSU 








dock score 


rmsd 


dock score 


rmsd 


dock score 


rmsd 




antibiotic 


(kcal mol" 1 ) 


(A) 


(kcal mol" 1 ) 


(A) 


(kcal mol" 1 ) 


(A) 


1 


chloramphenicol 


-3.44 


1.31 


-3.19 


1.33 


-3.55 


1.25 


2 


erythromycin 


-13.85 


1.01 


-11.04 


1.61 


-12.2 


0.97 


3 


azithromycin 3 


-21.47 


0.7 


-18.3 


1.78 


-18.64 


0.64 


4 


clindamycin 


-15.97 


1.13 


-14.44 


1.57 


-14.94 


1.07 


5 


thiostrepton b 


-2.69 


3.68 


-1.98 


1.69 


-2.31 


0.68 



a Modelled on the Thermus thermophilus ribosome -azithromycin crystal structure. 
b Modelled on the Deinococcus radiodurans ribosome -thiostrepton crystal structure. 



Prediction of the three-dimensional structures of parasite 
organelle ribosomes is demanding due to the difficulty in 
obtaining high-resolution experimental models. This is further 
complicated by the presence of highly fragmented rRNA 
encoded by the P. falciparum mitochondrial genome [11]. 
A stand-alone version of the RNA prediction tool ModeRNA 
was used for the comparative modelling of rRNA, whereas 
modelling of ribosomal proteins L4, Lll and L22 was 
performed by Modeller v. 9.10. For modelling of the mitochon- 
drial ribosome, different fragments of mitochondrial LSU 
rRNA were aligned manually on the basis of conserved 
secondary structure topology, modelled separately and then 
superimposed together on the E. coli template to obtain a com- 
plex RNA model structure. All the modelled subunits (rRNA 
and protein) were superimposed on the E. coli ribosome tem- 
plate to generate the apicoplast and mitochondrial ribosome 
complexes (figure 2). The fragmented rRNA comprising the 
core of the mitochondrial ribosome is highly reduced, though 
retains conservation of the peptidyl transferase centre and 
the peptide exit tunnel where most antibiotics bind. 

Molecular docking of antibiotics was performed on 
P. falciparum apicoplast and mitochondrial ribosome models 
using Autodock4. Autodock uses grid-based energy evalu- 
ation for docking, where ligands are treated as flexible 



entities by exploring torsional degrees of freedom of ligand 
molecules. The first step of the Autodock algorithm involves 
conformational sampling of ligands followed by prediction 
and ranking of free energy of binding of these conformations. 
One hundred Autodock runs were performed for each inhibi- 
tor. To validate the reproducibility and sensitivity of the 
docking program, Autodock4 was used to dock the inhibitor 
co-complexed with the E. coli template. The inhibitor dock 
scores obtained for apicoplast and mitochondrial ribosomes 
are given in table 3. In the apicoplast, L22 located at the bind- 
ing site for azithromycin [82] contains Arg88 that is predicted 
to form an H-bond with the inhibitor (figure 3a). Arg88 is 
replaced by Gly88 in mitochondrial L22 that does not form an 
H-bond with azithromycin. In addition, the rRNA sequence 
at the binding site also differs at two positions: A2612 and 
A2058 (E. coli number) in the apicoplast are replaced by 
C2612 and U2058, respectively, in the mitochondrion, a 
change that would alter the hydrophobic environment at the 
site. This might explain the differential specificity of azithromy- 
cin for organellar ribosomes. The higher affinity of the antibiotic 
for the apicoplast ribosome is also reflected in the lower dock 
scores obtained for azithromycin and the related macrolide ery- 
thromycin (table 3). Together with the LSU rRNA, L22 and L4 
are predicted to form the peptide exit tunnel on the ribosome. 



(a) 
(i) 



(ii) 




Figure 3. Modelling of antibiotic interactions with P. falciparum organelle ribosomes. (a) Azithromycin docked onto apicoplast (i) and mitochondrial (ii) ribosomes. 
As in the Thermus thermophilus ribosome- azithromycin structure, a single azithromycin molecule was docked at the binding site, (b) Interaction of clindamycin with 
apicoplast (i) and mitochondrial (ii) LSU rRNA. Bases that differ between the apicoplast and mitochondrial rRNA are shown in red and H-bonds as black lines. rRNA is 
in grey, L22 in cyan and antibiotics are in green. 



The G76V mutation of apicoplast L4 has been reported to con- 
tribute to azithromycin resistance in P. falciparum lines [79] and 
modelling on the ribosome -azithromycin structure predicted a 
conformational shift in the side chain of Leu75 of L4 that could 
interfere with the azithromycin binding pocket. However, this 
model was constructed on the Deinococcus radiodurans (an extre- 
mophile bacterium) model that proposed the binding of two 
azithromycin residues at the site, one that interacted with the 
LSU rRNA and the other with L4, L22 and LSU rRNA [83]. 
Structures of the Haloarcula marismortui (an archaeon) and 



Thermus thermophilus large ribosomal subunits complexed 
with azithromycin have since led to the conclusion that a 
single molecule of the antibiotic binds to the ribosome [82]. 
This is supported by biochemical experiments that indicate 
that only one azithromycin molecule is bound to the E. coli 
ribosome [84]. No direct role for L4 in the interaction of azithro- 
mycin with P. falciparum apicoplast and mitochondrial 
ribosomes was detected in our model. 

The only difference in the interaction site for clindamycin 
between the organelle ribosomes was an A2058U (E. coli 




number) transversion in the LSU rRNA of the parasite mito- 
chondrion (figure 3b). This residue forms H-bonds with the 
antibiotic in E. coli [85]. The LSU rRNA residue G2061, whose 
mutation in the apicoplast is associated with clindamycin resist- 
ance in T. gondii [78] and which is critical to the transpeptidation 
reaction, was conserved in the LSU rRNA of both organelles 
in Plasmodium. The binding site for chloramphenicol overlaps 
with that of clindamycin and no obvious differences could 
be detected in chloramphenicol interactions predicted for 
apicoplast and mitochondrial ribosomes; the dock scores for 
chloramphenicol were also comparable for P. falciparum orga- 
nelle ribosomes. However, the in silico approach used by us 
would have inherent weaknesses, and conclusions on actual 
interactions and affinity of these antibiotics for apicoplast/ 
mitochondrial ribosomes awaits experimental validation. 

Thiostrepton targets the GTPase associated centre of the 50S 
ribosome subunit and binds within a cleft between helices 43 
and 44 of the LSU rRNA and Lll. It overlaps with the position 
of domain V of elongation factor G (EF-G), thus perturbing the 
binding of the elongation factor to ribosomes [86]. Plasmodium 
falciparum organelle LSU rRNAs differ at two residues in the 
helices: the crucial A1067 site and A1095 (E. coli number) are 
replaced by G1067 and C1095 in the mitochondrion (figure 4). 
The former has been shown to alter binding of thiostrepton to 
the ribosome although introduction of an A1067G mutation in 
the apicoplast rRNA did not completely abolish in vitro inter- 
action with the antibiotic [87]. It is also important to note the 
low identity and consequent conformational differences in Lll 
of the apicoplast and mitochondrion that might influence 
interaction with thiostrepton. The structural models in figure 4 
as well as the ClustalW alignment of E. coli and P. falciparum 
organelle Lll proteins indicate greater similarity between the 
bacterial and parasite mitochondrial ribosome -thiostrepton 
interaction site compared with the apicoplast [86] (figures 4 
and 5). The identity between the mitochondrial and apicoplast 
Lll with the E. coli protein is 24.71% and 10.07%, respectively. 
In addition to targeting the apicoplast, thiostrepton has also 
been shown to act on the cytosolic proteasome [80] and has 
detectable effects on mitochondrial translation [81]. Thiostrep- 
ton is also able to partially lock P. falciparum mitochondrial 
EF-G onto surrogate E. coli ribosomes, an effect not observed 
with apicoplast EF-G [14]. 



Except for the macrolide antibiotics whose preferential 
interaction with P. falciparum apicoplast ribosomes can 
be explained on the basis of structural differences with 
ribosomes of the mitochondrion, few obvious structural 
explanations can be found for differential drug binding to 
apicoplast and mitochondrial ribosomes by other antibiotics 
tested by us. Apicoplast-specific inhibitory effects that have 
been observed with clindamycin and chloramphenicol may 
thus be due to differential sensitivity attributable to other bio- 
logical factors such as differences in drug accumulation in the 
two organelles or reduced rate of translation in the parasite 
mitochondrion. For thiostrepton, the docking results and 
structural models reported here support earlier biochemical 
data that the antibiotic targets both apicoplast and mitochon- 
drial translation thus mediating early parasite death. 

In conclusion, apicoplast and mitochondrial ribosomes of 
apicomplexan parasites have a unique and reduced compo- 
sition, a fact that would alter the nature of their interactions 
with protein translation factors. This survey is a starting 
point for further functional evaluation of the Plasmodium orga- 
nellar ribosome machinery, its assembly and interactions with 
translation factors and translation inhibitory compounds. 

4. Material and methods 

4.1. Databases and sequence searches 

To identify ribosomal proteins and ribosome assembly factors, 
we searched apicomplexan genomes using the GenBank non- 
redundant nucleotide and CDS translations [88] using tblastn 
and blastp, respectively. We additionally performed direct 
alignments between protein sequences and organellar gen- 
omes using blast2seq. Signal peptide portions of apicoplast 
targeting sequences were sought using SignalP v. 3.0 
[89] and by manual inspection of Kyte Doolitle hydropathy 
plots [90]. Gene models were examined using EuPathDB [91] 
and evidence for transcription start sites and alternative 
splicing based on RNAseq data examined using the GBrowse 
tool [92] implemented at EuPathDB [91]. Putative transit 
peptide portions of apicoplast targeting sequence were manu- 
ally inspected or were detected using the PlasmoAP [93] 
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Figure 5. ClustalW alignment of E. coli L11 with L11 predicted for the P. falciparum mitochondrion (a) and apicoplast (b). 



and PATS [94]. Putative mitochondrial transit peptides were 
manually inspected or were detected using PlasMit [95] or 
MitoProtll [96]. 

Where we identified clear ribosomal proteins that lacked 
clear annotations, or clear organellar trafficking that was not 
included in earlier annotations, we communicated updated 
annotations to curation staff. For high confidence assignments 
the gene names have been changed, for lower confidence assign- 
ments relevant comments have been added to gene pages at the 
EuPathDB [91] and GeneDB genome databases [97]. 

For ribosome assembly /biogenesis proteins, all predic- 
tions were made on the basis of annotations in PlasmoDB 
as well as assignments made by prediction algorithms — 
TargetP, PlasmoAP, PATS, PlasMit and MitoProt II. 



4.2. Molecular modelling 

Prediction of three-dimensional structure of ribosomal struc- 
ture is highly demanding owing to the difficulty in obtaining 
high-resolution experimental models. Present work describes 
the in silico modelling of apicoplast and mitochondrial large 
subunits of 23S ribosome followed by docking studies with 
known inhibitors to understand the comparative basis of speci- 
ficity of these inhibitors. To achieve the modelling of rRNA, an 
RNA prediction tool ModeRNA [98] was used, whereas mod- 
elling of ribosomal proteins L4, Lll and L22 was performed by 
Modeller v. 9.10 [99]. After model building of large subunit of 
23S rRNA, known inhibitors azithromycin, erythromycin, 
clindamycin, chloramphenicol and thiostrepton were docked 
into the peptidyl transferase site of modelled apicoplast and 
mitochondrial ribosome, respectively. 



ModeRNA is a comparative modelling tool of RNA which 
requires a template whose three-dimensional structure is 
known and which shares sequence similarity with the query 
sequence, the one to be modelled and pairwise alignment 
between template and the query sequences [98]. 23S ribosome 
of E. coli (PDB id: 30FC) was chosen as the template to model 
P. falciparum 23S rRNA in apicoplast as well as in mitochondria. 
As the secondary structures have been published for E. coli and 
P. falciparum ribosomes, the alignments were performed manu- 
ally on the basis of conserved secondary structure topology 
to facilitate the modelling of rRNA of apicoplast and mito- 
chondria in P. falciparum. The RNA models were built with 
the stand-alone version of the ModeRNA via a Python scripting 
interface based on the provided alignments. The default 
ModeRNA modelling procedure was followed. As the 
P. falciparum mitochondrial RNA is present in fragmented 
form, different fragments were aligned and modelled separ- 
ately and then superimposed on the template together to 
obtain a complex model structure. Simple geometry checks 
were performed using analyze_geometry function on template 
and target structures using ModeRNA stand-alone version to 
ensure the structural integrity of structure. 

All the protein models were built with Modeller v. 9.10 
based on homologous template structures in E. coli. For each 
case, 10 different models were produced and the one with 
the best DOPE score selected. ClustalW was used for align- 
ment between protein templates and the targets to generate 
comparative models. 

All the modelled subunits of each ribosome including RNA 
and protein were superimposed on template structure and 
merged together to form the complete ribosome. Although 
RNA sequences exhibit divergence, the overall structures 



of modelled ribosomes were found to be well conserved as 
indicated by the secondary structure topology. 

4.3. Molecular docking 

Structures of inhibitors were extracted from the Protein Data 
Bank (PDB) files of large subunit of 70S ribosome co-complexed 
with the respective inhibitors (PDB IDs: chloramphenicol 
(30FC), clindamycin (30FZ), erythromycin (30FR), azithromy- 
cin (30HZ) and thiostrepton (3CF5)). Molecular docking was 
performed on ribosome models used as a receptor to dock our 
inhibitors of interest using Autodock v. 4 [100]. Kollman charges 
were assigned with 40 x 40 x 40 grid points of 0.375 A spacing. 
One hundred Autodock runs were performed for each inhibitor. 



To validate the reproducibility and sensitivity of the dock- 
ing program, Autodock v. 4 was used to dock the inhibitor 
co-complexed with template. The limit of Autodock to read 
maximum atoms of macromolecules was kept constant to 
default and therefore 35 A around the ligands was considered 
only after superimposing the modelled structure on E coll 
ribosome structure. 
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